Member activities and quality of tags in a collection of historical photographs in Flickr
نویسندگان
چکیده
There is growing interest in, and an increasing number of attempts by, traditional information providers to engage social content creation and sharing communities in creating and enhancing the metadata of their photo collections to make the collections more accessible and visible. To enable and guide effective metadata creation, however, it is essential to understand the structure and patterns of the activities of the community around the photographs, resources used, and scale and quality of the socially created metadata relative to the metadata and knowledge already encoded in existing knowledge organization systems. This article presents an analysis of Flickr member discussions around the photographs of the Library of Congress photostream in Flickr. The article also reports on an analysis of the intrinsic and relational quality of the photostream tags relative to two knowledge organization systems: the Thesaurus for Graphic Materials and the Library of Congress Subject Headings. Thirty seven percent of the original tag set and 15.3% of the preprocessed set (after the removal of tags with fewer than three characters and URLs) were invalid or misspelled terms. Nouns, named entity terms, and complex terms constituted approximately 77% of the preprocessed set. More than a half of the photostream tags were not found in the TGM and LCSH, and more than a quarter of those terms were regular nouns and noun phrases. This suggests that these terms could be complimentary to more traditional methods of indexing using controlled vocabularies. Introduction Knowledge organization and representation systems (e.g., lists of terms, taxonomies, thesauri, ontologies) traditionally have been essential parts of the information organization and retrieval infrastructure in libraries and museums, and they have now become increasingly important on the Web to support entity and concept identification, semantic annotation, information retrieval, and question answering (e.g., Perez, 2009). Not surprisingly, there has been considerable research on controlled vocabulary and ontology construction, including research identifying quality index terms and on automatic concept and
منابع مشابه
Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites
In this paper, we propose an automatic photo tag expansion system for the community photo collections, such as Flickr. Our aim is to suggest relevant tags for a target photograph uploaded to the system by a user, by incorporating the visual and textual cues from other related photographs. As the first step, the system requires the user to add only a few initial tags for each uploaded photo. The...
متن کاملLearning Landmarks by Exploiting Social Media
This paper introduces methods for automatic annotation of landmark photographs via learning textual tags and visual features of landmarks from landmark photographs that are appropriately location-tagged from social media. By analyzing spatial distributions of text tags from Flickr’s geotagged photos, we identify thousands of tags that likely refer to landmarks. Further verification by utilizing...
متن کاملبررسی مقایسهای تأثیر برچسبزنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی
In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...
متن کاملA comparative study of Flickr tags and index terms in a general image collection
concepts (I) 218 (5.24%) 92.39 0.22 (0.64) 25 (0.67%) 96.84 0.03 (0.16) Art historical information (I) 66 (1.59%) 93.98 0.07 (0.29) 88 (2.37%) 99.21 0.09 (0.32) People-related attributes (I) 111 (2.67%) 96.65 0.11 (0.37) 29 (0.78%) 100 0.03 (0.19) Visual elements (P) 75 (1.80%) 98.45 0.08 (0.35) 0 (0.00%) 100 0.00 (0.00) Color (P) 64 (1.54%) 100 0.07 (0.35) 0 (0.00%) 100 0.00 (0.00) Total 4159 ...
متن کاملEu-Social Science: The Role of Internet Social Networks in the Collection of Bee Biodiversity Data
BACKGROUND Monitoring change in species diversity, community composition and phenology is vital to assess the impacts of anthropogenic activity and natural change. However, monitoring by trained scientists is time consuming and expensive. METHODOLOGY/PRINCIPAL FINDINGS Using social networks, we assess whether it is possible to obtain accurate data on bee distribution across the UK from photog...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JASIST
دوره 61 شماره
صفحات -
تاریخ انتشار 2010